Goto

Collaborating Authors

 zhang and matsen iv




Supplement for Improved Variational Bayesian Phylogenetic Inference with Normalizing Flows A Subsplit Bayesian networks D AB C AB CD ABC D ABC D AB CD D AB CD AB CD AB CD assign

Neural Information Processing Systems

Figure 1: A simple subsplit Bayesian network for a leaf set that contains 4 species A, B, C and D. This figure is adapted from Zhang and Matsen IV (2019). SBN (the one with a full and complete binary tree structure as shown in Figure 1) is good enough. The SBN framework also generalizes to unrooted trees, which are the most common type of phylogenetic trees. (Zhang and Matsen IV, 2018). Sampling from SBNs is also straightforward via ancestral sampling.



Variational phylogenetic inference with products over bipartitions

Sidrow, Evan, Bouchard-Côté, Alexandre, Elliott, Lloyd T.

arXiv.org Machine Learning

Bayesian phylogenetics requires accurate and efficient approximation of posterior distributions over trees. In this work, we develop a variational Bayesian approach for ultrametric phylogenetic trees. We present a novel variational family based on coalescent times of a single-linkage clustering and derive a closed-form density of the resulting distribution over trees. Unlike existing methods for ultrametric trees, our method performs inference over all of tree space, it does not require any Markov chain Monte Carlo subroutines, and our variational family is differentiable. Through experiments on benchmark genomic datasets and an application to SARS-CoV-2, we demonstrate that our method achieves competitive accuracy while requiring significantly fewer gradient evaluations than existing state-of-the-art techniques.


Improved Variational Bayesian Phylogenetic Inference with Normalizing Flows

Zhang, Cheng

arXiv.org Machine Learning

Variational Bayesian phylogenetic inference (VBPI) provides a promising general variational framework for efficient estimation of phylogenetic posteriors. However, the current diagonal Lognormal branch length approximation would significantly restrict the quality of the approximating distributions. In this paper, we propose a new type of VBPI, VBPI-NF, as a first step to empower phylogenetic posterior estimation with deep learning techniques. By handling the non-Euclidean branch length space of phylogenetic models with carefully designed permutation equivariant transformations, VBPI-NF uses normalizing flows to provide a rich family of flexible branch length distributions that generalize across different tree topologies. We show that VBPI-NF significantly improves upon the vanilla VBPI on a benchmark of challenging real data Bayesian phylogenetic inference problems. Further investigation also reveals that the structured parameterization in those permutation equivariant transformations can provide additional amortization benefit.